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Abstract 

We show that 2-tag systems efficiently simulate Turing machines. As 
a corollary we find that the small universal Turing machines of Rogozhin, 
Minsky and others simulate Turing machines in polynomial time. This 
is an exponential improvement on the previously known simulation time 
overhead and improves a forty year old result in the area of small universal 
Turing machines. 



1 Introduction 

It has been an open question for forty years as to whether the smallest known 
universal Turing machines (UTMs) are efficient simulators of Turing machines. 
This question is intimately related to a problem regarding the computational 
complexity of 2-tag systems. 

Shannon [21] was the first to consider the question of finding the small- 
est possible UTM, where size is the number of states and symbols. Early at- 
tempts [51 [27J gave small UTMs that efficiently (in polynomial time) simulate 
Turing machines. 

In the early 1960s Cocke and Minsky : 2[ showed that 2-tag systems simulate 
Turing machines, but in an exponentially slow fashion. Minsky [TS] found a 
small 7-state, 4-symbol UTM that simulates 2-tag systems in polynomial time. 
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Figure 1: State-symbol plot of small UTMs. The plot shows the polynomial 
time curve induced by our previous UTMs |17j . the exponential time curve of 
Minsky, Rogozhin and others [TJ [5J [THl 123] , and the non- universal curve for which 
there are no UTMs [3 [7l [T8J [19] . The present paper improves the polynomial 
time curve so that it coincides with the previous exponential time curve. Our 
result shows that a polynomial time UTM exists for each state-symbol pair that 
is on, above, and to the right of the new polynomial time curve. 



So this small UTM simulates Turing machines via the following sequence of 
simulations 

Turing machine i— > 2-tag system i— > small UTM (1) 

where A i— » B denotes that A is simulated by B. Later Rogozhin [23] and 
others [U [8j [22] used Minsky's technique of simulation via (JTJ) to find small 
UTMs for a range of state-symbol pairs (see Figure [1]). All of these small UTMs 
efficiently simulate 2-tag systems. However since the 2-tag simulation of Turing 
machines is exponentially slow it has remained an open problem as to whether 
these UTMs can be made to run in polynomial time. 

For our result we replace Jl} with the following sequence of simulations 

Turing machine i— > cyclic tag system 
h-+ 2-tag system i— > small UTM 

Specifically, in this paper we show that 2-tag systems efficiently simulate cyclic 
tag systems. In a recent paper [16] we have shown that cyclic tag systems effi- 
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ciently simulate Turing machines. Thus the present paper provides an important 
piece of the puzzle as far as the computational complexity of small UTMs and 
2-tag systems are concerned. Our main results states: 

Theorem 1 Given a single tape deterministic Turing machine M that com- 
putes in time t then there is a 2-tag system Tm that simulates the computation 
of M and computes in polynomial time 0(t 4 (log t) 2 ). 

This immediately gives the following interesting result. 

Corollary 1 The small UTMs of Minsky, Rogozhin and others J2 [23 H3 \M$ 
are polynomial time, (3(i 8 (log£) 4 ), simulators of Turing machines. 

Before our result it was entirely plausible that there was an exponential 
trade-off between UTM program size complexity, and time/space complexity; 
the smallest UTMs seemed to be exponentially slow. However our result shows 
there is currently little evidence for such a claim. 

Early examples of efficient small UTMs were found by Ikeno and Watan- 
abe [51 [57]. Prior to the present paper the smallest known polynomial time 
UTMs were to be found in [T7]. However these efficient machines are not as 
small as those of Rogozhin et al., hence the present paper represents a signif- 
icant size improvement when considering small polynomial time UTMs. This 
improvement is illustrated in Figure [TJ 

There are numerous other applications of Theorem [TJ The technique of 
simulation via 2-tag systems is at the core of many results in the broad survey 
by Margenstern [14] ; our result exponentially improves the time overheads in 
many [101 HU H21 H31 [21] of these constructions. Another example of the use 
of small efficient UTMs is given by Levin and Venkatesan [9J [25] who used the 
small 8-state, 5-symbol, polynomial time UTM of Watanabe's [27] to show the 
average case NP-completeness of a graph colouring problem. 

In the present paper, the phrase "small UTMs" refers to Turing machines 
that obey the standard definitions. Recently Cook [3] has found universal ma- 
chines that are significantly smaller than those discussed in the present paper. 
Cook's machines simulate the cellular automaton Rule 110, which Cook showed 
to be universal via an impressive simulation. However these machines are gener- 
alisations of standard Turing machines: their blank tape consists of an infinitely 
repeated word to the left and another to the right. Intuitively, this change of 
definition makes quite a difference, especially since Cook encodes a (possibly 
universal) program in one of these repeated words. It should be noted that 
Cook's UTMs are exponentially slow; in a recent paper [16j we have improved 
their simulation time overhead to polynomial. 

1.1 Preliminaries 

Let N = {0, 1,2,.. .}, £ denote a finite alphabet, and e denote the empty word. 
The function | | : S* x E* — > N is written as ||wi,i«2|| and gives the number 
of occurrences of the word u>2 in ui\. For example 1 1 1 101001 , 1 1 1 = 2. All 
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logs are in base 2. Throughout the paper a and # are constant symbols, and 
x, Xi G {0, 1}. We define the parity of a word to be odd if the read (leftmost) 
symbol is undotted and even if the read symbol is dotted. 

2 2-tag systems 

Tag systems where introducted by Post [2U], and 2-tag systems were shown to 
be universal by Cocke and Minsky [2]. 

Definition 1 (2-tag system) A tag system consists of a finite alphabet of 
symbols E, a finite set of rules R : £ — > E* and a deletion number ft G N, 
^ 1. For a 2-tag system (3 = 2. 

The 2-tag systems we consider are deterministic (or equivalently, monogenic [4, 
I2T)]). The computation of a 2-tag system acts on a dataword w = o\02 ■ ■ ■ <tj. 
The entire configuration is given by u>. In a computation step, the symbols u\G% 
are deleted and if there is a rule for o\ , i.e. a rule of the form o\ — » cr; + i . . . cr/+ c , 
then the word . . . cr/ +c is appended. We write w\ h W2 when the dataword 
(configuration) W2 is obtained from wi via a single computation step: 

0-1(72(73 ... <7; h cr 3 . . . (7/0- ;+ i . . . (T/ +c 

where tri — > . . . cr/ +c G i?. A 2-tag system completes its computation 
if (i) \w\ < P, or (ii) it enters a repeating sequence of configurations, or (iii) there 
is no rule for the leftmost symbol. The complexity measures of time and space 
are defined in the obvious way. Given a word w, we use the term round to 
describe [|w|/2] computation steps that traverse w exactly once [41 126] . 

We often write 2-tag symbols in pairs. The second (even numbered) symbol 
is dotted to distinguish it from the first. In the sequel we encode binary symbols 
in the following way, 1 is encoded as 11 and as 00. Also a single pair of 
symbols is distinguished by being 'barred': 00 or II. So an example encoding 
of the word 11010 is IiliOOliOO. 

Lemma 1 Let w = xqXqX\X2 ■ ■ ■ xi G {00, 1 1 }{0, 1}*. Then there is a 2-tag 
system T that tests whether \w\ is odd or even in exactly \_\w\/2\ + 1 timesteps. 

Proof The 2-tag system has 6 rules, R = {x — > } where 

x G {0, 1}. Initially T reads the leftmost symbol xq of w. After one round, 
if the read symbol is dotted then the output is the single symbol x signifying 
that \w\ is odd. Otherwise the output is x signifying that |iu| is even. | 

Despite its simplicity, the proof idea of Lemma [T] constitutes one of the 
main ingredients of Cocke and Minsky's (exponentially slow) 2-tag simulation 
of Turing machines [2]. The 2-tag system dataword encodes an arbitrary TM 
configuration as two unary numbers. The left side of the tape is encoded as 
one unary number, the right side as another. Their simulation makes use of 
repeated tests for oddness and evenness of dataword length. Also doubling and 
halving of dataword length is used to read and write to the simulated tape. 
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3 Cyclic tag systems 



Cyclic tag systems were introduced by Cook [3]. 

Definition 2 (cyclic tag system |3j) A cyclic tag system C = cto, a±, . . . , 
is a list of binary words a m E {0, 1}* called appendants. 

A configuration of a cyclic tag system consists of (i) a marker that points to 
a single appendant a m in C, and (ii) a dataword io = xqX\ . . .Xi E {0,1}*. 
Intuitively the list C is a program with the marker pointing to instruction a m . 
At the initial configuration the marker points to appendant cto and w is the 
binary input word. 

Definition 3 (computation step of a cyclic tag system) A computation step 
is deterministic and acts on a configuration in one of two ways: 

• If Xo = then xq is deleted and the marker moves to appendant ar m+ i) mo( j p . 

• If Xq = 1 then xq is deleted, the word a m is appended onto the right end 
of w, and the marker moves to appendant a.( m +\) modp- 

A cyclic tag system completes its computation if (i) the dataword is the 
empty word, or (ii) it enters a repeating sequence of configurations. The com- 
plexity measures of time and space are defined in the obvious way. 

Example (cyclic tag system computation) Let C — 00,010,11 be a cyclic tag 
system with input word 011. Below we give the first four steps of the compu- 
tation. In each configuration C is given on the left with the marked appendant 
highlighted in bold font. 

00,010,11 011 h 00,010,11 11 
h 00,010,11 1010 h 00,010,11 01011 
h 00,010,11 1011 h ... 

We write an arbitrary single step of a cyclic tag system computation as 

ota, ■ ■ ■ , a m _i,a m , a m +i, ■ ■ ■ , «p-i xqX\...xi 
V- ao,...,a ra ,a m+ i,a m+2 ,...,a p _i x\. . . x%xi + \. . . Xi +C 

where x E {0,1}, and as usual if xq = then x;+i . . . xi+ c = e, otherwise 
if x = 1 then x i+1 . . . xi +c = a m E {0, 1} C , c E N. 

Cook [3] used the universality of cyclic tag systems to show that Rule 110, 
a binary one-dimensional cellular automaton, is universal. Recently we have 
improved on Cook's work by showing that cyclic tag systems simulate Turing 
machines in polynomial time: 

Theorem 2 ( [16j ) Let M be a single-tape deterministic Turing machine that 
computes in time t. Then there is a cyclic tag system Cm that simulates the 
computation of M in time 0(t 3 logt). 

In order to calculate this upper bound we substitute space bounds for time 
bounds whenever possible in the analysis. 



5 



4 2-tag systems efficiently simulate cyclic tag 
systems 

In this section we prove Theorem [TJ 
4.1 Encoding 

Cyclic tag systems use a binary alphabet and program control is determined by 
the read symbol and the value of the program instruction marker. On the one 
hand 2-tag systems seem more general than cyclic tag systems as an arbitrary 
constant (independent of input length) sized alphabet is permitted. On the other 
hand 2-tag systems seem more restricted in that program control is determined 
solely by the read symbol. 

Because of this restriction we use a large number of symbols in our construc- 
tion. The number of such symbols is a constant that is independent of input 
length, but is dependent on our simulation algorithm and the size of the sim- 
ulated cyclic tag system program. In our encoding we decorate symbols with 
dots (x,x), bars (x) and under- indexes (x). These decorations are used for 
algorithm control flow. 

Definition 4 (2-tag input encoding) The cyclic tag system input data-word 
w = x$xi . . . x n E {0, 1}* is encoded as the 2-tag dataword 

w = xqXq Xi&i X2X2 ■ ■ ■ x n x n ah ah ... ah 

where the number of ah pairs in w is \\w, ah\\ = 2^ log ( n+1 ^ and the extra whites- 
pace between symbol pairs is for human readability purposes only. 

This encoding is computable in logspace. The subword aa ah ... aa is used 
as a counter and its value \\w, aa\\ is used extensively in our algorithms below. 

An arbitrary (not necessarily input) cyclic dataword is encoded similarly to 
Definition [4] except that the counter is 'embedded' in w. Specifically if w = 
xqXi . . . xi then 

w — XqXq X\X\ . . . Xiii aa . . .ah Xi + i±i + i . . . xiii (3) 

3333 3333 3 3 3 3 3 3 

for some i £ {0, ■ ■ ■ ,1} and j S N. Furthermore the counter value is some power 
of 2 strictly greater than I, that is 

\\w,aa\\ = 2r iog ''l (4) 

where I' E N, I' > I. 

In our simulation the counter does not grow too large. As will be shown, 
the counter value is never more than double the longest dataword length that 
occurs in the simulated cyclic tag system computation. 
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4.2 The simulation 



We wish to show that there is a 2-tag system that simulates an arbitrary single 
step of a cyclic tag system computation, as defined in Equation @ . We decom- 
pose Equation into three conceptual steps: (i) if Xq = 1 then simulate the 
rule xq — > a rn by appending a m — xi+i . . .xi+ c , (ii) set x\ to be the new read 
symbol and delete xq, (iii) increment the program marker m so that the next 
appendant is a {m+1 mod p y 

We begin by giving a simulation of (ii) . Informally speaking, Lemma [2] 
states that there is a 2-tag system that (efficiently) moves the 'bar' forward by 
one symbol pair. The main difficulty is in distinguishing X\Xi from the other 
unbarred symbol pairs £2^2, ■ • ■ ,XiXi- This Lemma is stated for the case that 
the counter value is the next power of two greater than the encoded dataword 
length. 

Lemma 2 Given a word of the form 



in time 0(1 log I). Here \\w,aa\\ = ||*,ad|| = 2r i °s('+ 1 )l and i G {0, ...,/}. 

Proof idea There are 5 stages to the 2-tag algorithm. We let wq = w and 
let Wk denote the output of the fc th iteration of the 5 stages. 

In Stages 1 to 3 of iteration k we compute |~||z£>/c_i, by marking every 

second pair of xx symbols (we mark the even numbered pairs). Then in Stages 
4 and 5 we halve ||u?fc_i, aa\\ by marking every second pair of ad symbols (again 
we mark the even numbered pairs). We then return to Stage 1 and iterate 
until ||t&fc,od|| = 1: the counter has an odd value for the first time and we 
detect this. The number of fully completed iterations, and final value for k, 
is k = log \\w, aa\\. At this point xiii is the only pair of unmarked xx symbols 
in w, and so Xx&i is isolated (unique) from the other symbol pairs in ii. We 
delete xqXq. The uniqueness of x%xi enables the rule x\ — > x\Xi to be executed 
successfully. 

Proof details As usual let x € {0, 1}. Here we specify 2-tag rules, and take 
the reader through a single (the first) iteration of these rules, for illustration 
purposes we choose i = I. 

In Stage 1 we begin with a word of the form 



w = XqXq X\X\ . . . XiXi aa . . .aa Xi + iXi+i . . . xixi 



then there is a 2-tag system T that computes 




aa x i+1 x i+1 . ..xm 

6 6 6 6 6 6 



XqXq x\X\ X2X2 ■ ■ ■ x\Xi aa aa . . . aa 



1111 11 111111 11 



Stage 1 consists of the rules: 



1 oo'i ion' 



1 2 2 1 2 2 2 



a 



aa, 



2 2 
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as well as a few more rules that are given below. After one round we have 

XqXq Xiiiii X2X2X2 ■ ■ ■ X1X1X1 ad ad ... ad (5) 

2 U 2 U 2 1 2 1 2 1 2 Z 2 Z 2 Z 2 1 2 1 2 i 22 22 22 W 

The Stage 2 rules are 

{nn t nn nn nn t nn nn nn v .-/i .-p nn \ nf* v nh nh nn k nh nh 

2 3 3 ' 2 3 3 '2 3T' 2 ft' t £#> 

a — > ad, a — > ad, d — > dd\ $ — > dd\ 

2 33 2 33 2 33 2 3 3 

Continuing from ([5]), after one round we see that every second (even numbered) 
pair of xx is marked 

f of o f if i gaga fsfs ■ ■ ■ Jj-ifi-i gigl aaaa...aa 

where (for illustration purposes only) we assume that / is even. 
The Stage 3 rules are: 



{x — * xx, x — » xx, & — > riot, a — > ad, d — > 

L 3 44 3 44 3 44 3 44 3 

x — > #xx, x —> xx, $ — y ii, ix — > ad, d - 

3 44 3 44 3 4 4 3 44 3 

We enter Stage 3 by reading cither a dotted symbol (there was an odd number of 
unmarked xx pairs in Stage 1) or undotted symbol (there was an even number 
of unmarked xx pairs in Stage 1). Stage 3 begins by checking this (in one step); 
if the parity is even, that is the 2-tag system is reading dotted symbols, then a # 
symbol is appended to restore the parity to odd after one round. On completion 
of Stage 3 we are reading an undotted symbol: 

inin x\i\ i2$2 xziz $a}a ■ ■ ■ xi-i&i-i ad ad ... ad 

4 U 4 U 4 1 4 1 4°4° 5*5^ 4 ' 1 4 ' 1 Ft' 4 4 4 4 44 

In Stages 4 and 5 of iteration k we halve the value of the counter (we compute 
||u)fc,aa|| = \\wk~i, ad||/2), in a similar fashion to Stages 1 to 3. The Stage 4 
rules are 

{x ii.i — ► xx, £ — y qtgt , a — > add, d — > dd\ 

L 4 55 4 5 5 4 55 ' 4 5 5 5 ' 4 5S J 

Which after one round gives 

XqXq Xi±i $2$2 ■ ■ ■ iiii add add . . . add 

55 55 55 55 555555 555 

The Stage 5 rules then halve the counter value: 

J rn y rn nf* nn ^ nn nn nh ^ nh nf* nn i nn nn nh ^ nh nh 

L 5 115 U S U S 115 11 

a -> aa,a ->• dii,a -y e,d^ -> dd} 
Continuing our computation we get: 

x of o xxxx f 2 fi ■ • ■ gzg i ad ddaddd... ad dd 
which switches control back to Stage 1. 
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Each iteration of the 5 stages halves the counter value. After log \\w, aa\\ 
iterations the counter has value 1, this causes the output from Stage 4 to be 
of odd length for the first time. This in turn switches parity to even (dotted 
symbols) during Stage 5, which is detected at the beginning of Stage 1, by the 
rules: 

\x — * #, x — > xx, i — > xx, a — > ad, <L — > ad} 

k 1 1 661 661 661 66 J 

The first of these rules deletes x$ in one step. The second of these rules passes 
the bar forward by one symbol pair, while 2 of the others unmark the remaining 
symbols. 

x\X\ X2&2 X3X3 • ■ ■ xi&i ah ah ... ad 

6 6 6 6 6 6 6 6 66 66 66 

The # symbol restores the parity to odd so that we read undotted symbols (in 
subsequent computations in this paper). 

For our example iteration we chose i = I. If the dataword w is in the more 
general form i £ {0, . . . , 1} given by the lemma statement then the same proof 
holds; our rules are such that embedding the counter inside the dataword does 
not affect parity in a way that would change the algorithm control flow. | 

The following lemma provides much of the mechanics required for simulation 
of the appending of a cyclic tag system appendant (point (i) from the introduc- 
tory paragraph of Section |4~2"| . Simulating the appending is straightforward, 
the main work is in maintaining the equality in Equation Thus, after ap- 
pending, if the simulated dataword has length that is at least the counter value 
then we double the counter to satisfy Equation (j4|). The Lemma is stated for 
the case that the counter value is initially the next power of two greater than 
the encoded dataword length. 

Lemma 3 Given a word of the form 

w = 11 Xiii . . . XiXi ah ah ... ah Xi+i&i+i . . . Xiii 

00 0^0^ D O 

where \\w,ah\\ — 2^ iog ( l+1 ^ and i G {0,...,/}, then there is a 2-tag system T 
that computes 

™ = \l I !? 1 • ■ ' ff ■ ' ■ fi+ifi+i ■ ■ ■ fifi fi+ifi+i • ■ ■ ^i+cii+c 

where c 2r iog (' +1 )l , ||t&,aa|| = 2^ og( - l+c+1 ^ . T completes this computation in 
time 0(1 log/). 

Proof By applying the rule 

U ~* o i+i ■ ■ ■ o i+c: s i + c W 

to w we get a word denoted Wq that is of a similar form to w except that 
||w;o,ad|| = 2^° s ^ l+1 ^ , i.e. the counter has not yet been updated to the correct 
value of ||w),ad|| — 2r iog ( i+c+1 )l . The remainder of the proof is concerned with 
updating the counter. 
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We let u>k denote the output of the fc th iteration of the 4 stages. The rules 
for Stages 1 to 4 are of a similar flavour to those used in the proof of Lemma [2j 
so we omit them in favour of a brief overview. 

We begin by computing on wq. During Stages 1 and 2 of iteration fc, we 
compute \\wk, aa\\ = \\uik-i, aa\\/2, by marking every second pair of ad symbols 
(here we mark the even numbered pairs). Then in Stages 3 and 4 we compute 



\Wk, XX 



\\Wk-l,X±\\ 



(0) 



by marking every second pair of xx symbols (here we mark the odd numbered 
pairs). We then return to Stage 1 and iterate until ||t£>fe, ad|| = 1: the counter 
now has an odd value (for the first time) and we detect this in Stage 3. The 
number of fully completed iterations, and final value for fc, is fc = log \\w, ad\\ — 
\\og(l + 1)]. An additional stage restores the parity (by introducing an extra # 
symbol and then deleting it after one round) so that we are reading undotted 
symbols. Then iik is of the form 

Wk = 11 • ■ ■ ad ^| . . . di . . . ^;|; . . . ,^ +c |; +c 

At this point the number ||w?fe,a;i;|| of unmarked xx pairs satisfies ^ 
\\w k ,xx\\ < 1. To see this, note that ||t2> ,ad|| = 2l 1 °g( i + 1 )l and c s$ 2^ l+1 ^ . 
Solving Equation (J6j) for fc = \\og{l + 1)1 gives iff 

==: [[ifl ,a:a:|| = l + c < 2^ os( - l+1 ^ 

and 1 iff 

2 nog(i+i)i ^ 11^,^11 = / + c<2 riog(i+c+Di 

There are no other possible values for ||wo, si || thus we have only to check 
whether \\vSk, xx\\ is or 1. To prepare, in two consecutive rounds we apply the 
rules 



{1 — » 11, x — ► xxx, & —> ijt, a — v ad, d — > adl 

L 5 66 5 666 5 66 5 66 5 66 J 



and 



\ 1 ^ 1 1 • ~ ^ tZ/ *JJ > *JJ ^ ^ . tlj ^ 00 fJC * 30 ^ CO 00 - 

L 6 77 6 77 6 6 77 6 77 

a — ► ad, a — ► ad, d — » ad, d — > adl 

6 77 6 77 6 77 6 77 J 

These rules have the effect of shifting the parity of the read head to dotted 
symbols iff xx\ = 1. In addition these rules unmark all marked symbols, 
as the marks are not needed below. 
We have two cases: 

Case 1: If = we do not need to change the counter value in order 

to satisfy Equation (HI). In this case we detect ||i<jfc, xo; || = by reading the 
undotted, barred symbol 1 . To complete the computation we apply the rules 

{I — > 11, x — > xx, a — > ad) 

7 88 7 88 7 88 
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Case 2: If ||wfe,xa;|| = 1 we double the counter to satisfy Equation In this 
case we detect ||iUfc,a:£|| = 1 by reading the dotted, barred symbol 1. We then 
restore odd parity and double the counter value, by applying the rules 

{1 — > Sll.x — > xx, a — > ad ad} 

L 7 88 7 88 7 8888 J 

I 

4.3 Proof of main result 

Theorem 3 Given a cyclic tag system C that computes in time t(n) on input 
of length n, where n is at least the length of C 's longest appendant, then there 
is a 2-tag system Tq that simulates the computation of C and computes in 
time 0(t 2 (n) logi(rc)). 

Proof As stated, it is assumed that C's input length n is at least that of its 
longest appendant. As part of the 2-tag input encoding in Definition 21 the 
constant number of shorter inputs are assumed to be padded to this length. 

Recall, from the beginning of Section POI the decomposition of a single cyclic 
tag computation step into the conceptual steps (i), (ii) and (iii). 

Lemma [3] provides the algorithm for step (i) for the case that x = I. For 
the other case of x = we skip (i). Deciding between the two cases is easily 
implemented by setting the parity to even iff Xq = 1. 

Lemma [5] provides the main mechanics for step (ii). After applying Lemma[5] 
the barred pair xx is either on the left of the counter (i > in the lemma 
statement), or else has jumped over the counter and is now on the right (i = 0). 
Lemma [3] assumes the head is on the left. Therefore if the head is on the right 
we want to move it to the left so that we can apply Lemma [3] in subsequent 
iterations. Therefore as a second part of step (ii) we do the following. In one 
round we read the entire dataword w, and reproduce w exactly except that the 
underindex is incremented and the length of the barred subword is increased 
from 2 to 3 by a rule of the form x — > xxx. On the one hand, if in the next 
round we read dotted counter symbols (d) then the head is to the left of the 
counter; then in one more round we set the parity to odd and we're done. On 
the other hand, if we read undotted counter symbols (a) then we set the parity 
to odd in one round and furthermore we read the counter an extra time to place 
it to the right of the head. 

Lemata [2] and [3] state that the counter value should be the next power of 
two greater than C's dataword length I. It is not difficult to check that their 
proofs also hold for the more general case where the counter value is any power 
of two greater than I. Therefore these proofs are still applicable if C's dataword 
drastically decreases in length. In this case the big-Oh time bounds in these 
proofs depend on the counter value rather than I. But since the counter value 
is 0(t(n)) this does not affect our overall complexity analysis. 

For step (iii) we introduce a new decoration for 2-tag symbols. So far, the 
number q of distinct 2-tag symbols that we have used is dependent on our 
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algorithm. We now increase this number to pq where, as usual, p is the number 
of appendants of C. We create a new symbol set by decorating each 2-tag symbol 
y £ {x, x~,x,x,x,yl:,jt;,a,a,a,$,$,, jf\ with an integer m for all ^ m < p. Using 
this, our encoding of an arbitrary cyclic tag system configuration is of the form 

XqXq Xi±i . . . Xiii aa aa . . . aa Xi + \x i+ i . . . x\X\ 

3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 

Steps (i) and (ii) are simulated, while ignoring the value of m. (Note that 
our 2-tag algorithms are easily concatenated by having appropriate j values at 
the beginning and end of each algorithm.) Then j is given a value that signals 
the completion of steps (i) and (ii) . Then step (iii) (incrementing the program 
marker) is simulated by rules of the form 

{x — > x x, x ^ x x, a — > aa} 

3 3 3 3 3 3 3 3 3 

where ml — (m + 1) mod p. Applying these rules for a given k takes only one 
round, or 0(1) timesteps. 

In the time analysis of the computation of Tc note that for an arbitrary 
timestep of C we have I — 0(t(n)). Therefore via Lemmas [3] and [H and the 
present proof, Tc simulates a single step of C's computation in time 
0(t(n) log t(n)). | 

We get the proof of Theorem[TJ our main result, by combining the statements 
of Theorems[2]and[3] If we combine these Theorems directly we get a time bound 
that is higher than that of Theorem [1] To get our tighter bound a more careful 
analysis is required where we substitute space bounds for time bounds whenever 
possible in the analysis. 
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